Probability distribution of dependency distance

نویسنده

  • Haitao Liu
چکیده

This paper investigates probability distributions of dependency distances in six texts extracted from a Chinese dependency treebank. The fitting results reveal that the investigated distribution can be well captured by the right truncated Zeta distribution. In order to restrict the model only to natural language, two samples with randomly generated governors are investigated. One of them can be described e.g. by the Hyperpoisson distribution, the other satisfies the Zeta distribution. The paper also presents a study on sequential plot and mean dependency distance of six texts with three analyses (syntactic, and two random). Of these three analyses, syntactic analysis has a minimum (mean) dependency distance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The effects of sentence length on dependency distance, dependency direction and the implications-Based on a parallel English-Chinese dependency treebank

Dependency distance is closely related to human working memory capacity, but is also influenced by other non-cognitive factors. Studies of dependency distance contribute to the understanding of the universalities and peculiarities of languages as well as human cognitive processes in language. Forty two sentence sets were selected from a parallel English–Chinese dependency treebank to examine th...

متن کامل

A framework for medical image retrieval using merging-based classification with dependency probability-based relevance feedback

Content-based image retrieval (CBIR) systems are used to retrieve relevant images from large-scale databases. In this paper, a framework for the image retrieval of a large-scale database of medical X-ray images is presented. This framework is designed based on query image classification into several prespecified homogeneous classes. Using a merging scheme and an iterative classification, the ho...

متن کامل

The Zografos–Balakrishnan-log-logistic Distribution

Tthe Zografos–Balakrishnan-log-logistic (ZBLL) distribution is a new distribution of three parameters that has been introduced by Ramos et el. [1], and They presented some properties of the new distribution such as its probability density function, The cumulative distribution function, The  moment generating function, its hazard (failure) rate function, quantiles and moments, Rényi and Shannon ...

متن کامل

Shear-flow-enhanced barrier crossing.

We consider a single Brownian particle confined in a double well potential (DWP) and investigate its response to a linear shear flow by means of the probability density and current determined via numerical solution of the Fokker-Planck equation. Besides a shear-dependent distortion of the probability distribution, we find that the associated current crossing the potential barrier exhibits a con...

متن کامل

Analysis of Dependency Structure of Default Processes Based on Bayesian Copula

One of the main problems in credit risk management is the correlated default. In large portfolios, computing the default dependencies among issuers is an essential part in quantifying the portfolio's credit. The most important problems related to credit risk management are understanding the complex dependence structure of the associated variables and lacking the data. This paper aims at introdu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Glottometrics

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2007